Generation for Multilingual MT
نویسندگان
چکیده
This paper presents an overview of the broad-coverage, application-independent natural language generation component of the NLP system being developed at Microsoft Research. It demonstrates how this component functions within a multilingual Machine Translation system (MSR-MT), using the languages that we are currently working on (English, Spanish, Japanese, and Chinese). Section 1 provides a system description of MSR-MT. Section 2 focuses on the generation component and its set of core rules. Section 3 describes an additional layer of generation rules with examples that address issues specific to MT. Section 4 presents evaluation results in the context of MSR-MT. Section 5 addresses generation issues outside of MT.
منابع مشابه
A Framework for MT and Multilingual NLG Systems Based on Uniform Lexico-Structural Processing
In this paper we describe an implemented framework for developing monolingual or multilingual natural language generation (NLG) applications and machine translation (MT) applications. The framework demonstrates a uniform approach to generation and transfer based on declarative lexico-structural transformations of dependency structures of syntactic or conceptual levels ("uniform lexico-structura...
متن کاملReusing MT Components in Natural Language Generation for Dialogue Systems∗
This paper describes ongoing research on NLG for dialogue systems. Sentence planning is performed by selecting the appropriate template, while previously developed transfer and generation components of a transfer–based MT architecture perform the lexicalization and linguistic realization processes in the generation process. This approach allows for multilingual generation since there is a clear...
متن کاملMultilingual Sentence Generation
This paper presents an overview of a robust, broad-coverage, and application-independent natural language generation system. It demonstrates how the different language generation components function within a multilingual Machine Translation (MT) system, using the languages that we are currently working on (English, Spanish, Japanese, and Chinese). Section 1 provides a system description. Sectio...
متن کاملIntentions, Rhetoric, Or Discourse Relations? A Case From Multilingual Document Generation
The TECHDOC system [R5sner, Stede 92b] is an implemented prototype that starts from a domain knowledge base about maintenance plans, objects and actions involved, potential hazards etc. and delivers simultaneously generated instruction texts in at the moment three supported languages (English, German and French). Our approach is best characterized as an at tempt to demonstrate the feasibility o...
متن کاملITRI - 98 - 05 Multilingual authoring using feedback texts Richard Power and Donia
There are obvious reasons for trying to automate the production of multilingual documentation, especially for routine subjectmatter in restricted domains (e.g. technical instructions). Two approaches have been adopted: Machine Translation (MT) of a source text, and Multilingual Natural Language Generation (M-NLG) from a knowledge base. For MT, information extraction is a major difficulty, since...
متن کامل